Probabilistic feature-based transformation for speaker verification over telephone networks
نویسندگان
چکیده
Feature transformation aims to reduce the effects of channeland handset-distortion in telephone-based speaker verification. This paper compares several feature transformation techniques and evaluates their verification performance and computation time under the 2000 NIST speaker recognition evaluation protocol. Techniques compared include feature mapping (FM), stochastic feature transformation (SFT), blind stochastic feature transformation (BSFT), feature warping (FW), and short-time Gaussianization (STG). The paper proposes a probabilistic feature mapping (PFM) in which the mapped features depend not only on the top-1 decoded Gaussian but also on the posterior probabilities of other Gaussians in the root model. The paper also proposes speeding up the computation of PFM and BSFT parameters by considering the top few Gaussians only. Results show that PFM performs slightly better than FM and that the fast approach can reduce computation time substantially. Among the approaches investigated, the fast BSFT (fBSFT) strikes a good balance between computational complexity and error rates, and FW and STG are the best in terms of error rates but with higher computational complexity. It was also found that fusion of the scores derived from systems using fBSFT and STG can reduce the error rate further. This study advocates that fBSFT, FW, and STG have the highest potential for robust speaker verification over telephone networks because they achieve good performance without any a priori knowledge of the communication channel.
منابع مشابه
Probabilistic Neural Networks Combined with Gmms for Speaker Recognition over Telephone Channels
In this paper we study the applicability of Probabilistic Neural Networks (PNNs) as core classifiers to medium scale speaker recognition over fixed telephone networks. In particular, banking applications with up to 400 enrolled speakers and short training times are targeted. Two PNN-based open-set text-independent systems for Speaker Identification and Speaker Verification correspondingly are p...
متن کاملStochastic Feature Transformation with Divergence-Based Out-of-Handset Rejection for Robust Speaker Verification
The performance of telephone-based speaker verification systems can be severely degraded by linear and non-linear acoustic distortion caused by telephone handsets. This paper proposes to combine a handset selector with stochastic feature transformation to reduce the distortion. Specifically, a GMMbased handset selector is trained to identify the most likely handset used by the claimants, and th...
متن کاملText-independent Speaker Verification Based on Probabilistic Neural Networks
In this paper, a text-independent Probabilistic Neural Network (PNN)-based Speaker Verification system is presented. Modular structure with a distinct PNN for each enrolled speaker is used. A gender-dependent universal background model is built to represent the impostor speakers. A detailed description of the system, as well as the time required for training and processing all the test trials i...
متن کاملImpostor Modelling Techniques for Speaker Verification Based on Probabilistic Neural Networks
The impact of two different impostor modelling techniques on the performance of a Probabilistic Neural Networks (PNNs)-based text-independent speaker verification system is studied. Depending on the technique used for background codebook construction, two versions of the system are obtained: with one universal background codebook, common to all authorized speakers, and with an individual speake...
متن کاملCluster-Dependent Feature Transformation for Telephone-Based Speaker Verification
This paper presents a cluster-based feature transformation technique for telephone-based speaker verification when labels of the handset types are not available during the training phase. The technique combines a cluster selector with cluster-dependent feature transformations to reduce the acoustic mismatches among different handsets. Specifically, a GMM-based cluster selector is trained to ide...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Neurocomputing
دوره 71 شماره
صفحات -
تاریخ انتشار 2007